Goto

Collaborating Authors

 multimodal dataset enabling ai


Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity (Supplemental Material) Chih-Hsuan Yang

Neural Information Processing Systems

Arboretum is a 134.6M sample dataset designed to advance AI for biodiversity applications by providing a large-scale, accurately annotated multimodal dataset that includes images and corresponding Arboretum aims to facilitate the development of AI models for species identification, ecological monitoring, and agricultural research. The dataset is hosted on Hugging Face. Our dataset will be available for as long as the iNaturalist Open Dataset is maintained.